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Abstract. We provide deviation inequalities for properly normalized sums of bifurcating 
Markov chains on Galton-Watson tree. These processes are extension of bifurcating Markov 
chains (which was introduced by Guyon to detect cellular aging from cell lineage) in case the 
index set is a binary Galton-Watson process. As application, we derive deviation inequalities 
for the least-squares estimator of autoregressive parameters of bifurcating autoregressive 
processes with missing data. These processes allow, in case of cell division, to take into 
account the cell's death. The results are obtained under an uniform geometric ergodicity 
assumption of an embedded Markov chain. 
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1. Introduction 

Bifurcating Markov chains (BMC) on Galton-Watson (GW) tree are an extension of BMC 
to GW tree data. They were introduced by Delmas and Marsalle [12j in order to take into 
account the death of individuals in the Escherichia coli's (E.coli) reproduction model. E.coli 
is a rod-shaped bacterium which reproduces by dividing in the middle, thus producing two 
cells. One which has the new pole of the mother and that we call new pole progeny cell, and 
the other which has the old pole of the mother and that we call old pole progeny cell. In fact, 
each daughter cell has two poles. One which is new (new pole) and the other which already 
existed (old pole). The age of a cell is given by the age of its old pole (i.e the number of 
generations in the past of the cell before the old pole was produced). 

Guyon & Al |15| proposed the following linear Gaussian model to describe the evolution of 
the growth rate of the population of cells derived from an initial individual: 

{X2n = OtoX n + /3o + S2n 
(1.1) 
X2n+l = Ct\X n + Pi + £2n+l; 

where X n is the growth rate of individual n, n is the mother of 2n (the new pole progeny cell) 
and 2n + 1 (the old pole progeny cell), v is a distribution probability on M, a^,ai £ (—1, 1); 
/3o,/?i € M and ((£2n> £2n+i), n > l) forms a sequence of i.i.d bivariate random variables with 
law A/2(0, r), where 

T = ° 2 (l P X ct2 >°> pe(-l,l). (1-2) 

The processes (X n ) defined by (jl.ip are typical example of BMC which are called the first 
order bifurcating autoregressive processes (BAR(l)). The BAR(l) processes are an adaptation 
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of autoregressive processes, when the data have a binary tree structure (see Figure 1). They 
were first introduced by Cowan and Staudte [9j for cell lineage data where each individual in 
one generation gives rise to two offspring in the next generation. 

In |14j . Guyon, using the theory of BMC, gave laws of large numbers and central limit 
theorem for the least-squares estimator 9 r = (Sg, /3q, a[, /3{) of the 4-dimensional parameter 
= (aoi A)> a i> He has also built some statistical tests which allow to test if the model 
is symmetric or not, and if the new pole and the old pole populations are even distinct in 
mean. This allowed him to conclude a statistical evidence in aging in E. Coli. Let us also 
mention [6j, where Bercu & Al. using the martingale approach give asymptotic analysis of the 
least squares estimator of the unknown parameters of a general asymmetric pth-order BAR 
processes. 

However, in the BMC model presented by Guyon, cells are assumed to never die (a death 
corresponds to no more division). To take into account cells's death, Delmas and Marsalle 
[12] . instead of a regular binary tree, used a binary GW tree to label cells. In the sequel, 
we will introduce the model which allowed them to study the behavior of the growth rate of 
cells, taking into account their possible death. 

1.1. The model. Let T be a binary regular tree in which each vertex is seen as a positive 
integer different from 0, see Figure [TJ For r £ N, let 

r 

G r = {2 r , 2 r + 1, • • • , 2 r+1 - l}, T r = |J G 9 , 

g=0 

which denote respectively the r-th column and the first (r + 1) columns of the tree. Then, 
the cardinality |G r | of G r is 2 r and that of T r is |T r | = 2 r+1 — 1. A column of a given integer 
n is G rn with r n = [log 2 n\ , where \x\ denotes the integer part of the real number x. 
The genealogy of the cells is described by this tree. In the sequel we will thus see T as a given 
population. Then the vertex n, the column G r and the first (r + 1) columns T r designate 
respectively individual n, the r-th generation and the first (r + 1) generations. The initial 
individual is denoted 1. The model proposed by Delmas and Marsalle [12] is defined as follows. 
The growth rate of cell n is X n . 

• With probability pio, n gives birth to two cells 2n and 2n + 1 with both divide. The 
growth rate of the daughters X 2n and X 2n +\ are then linked to the mother's one 
through auto-regressive equations (jl.ip . 

• With probability po, only the new pole 2n divides. Its growth rate X 2n is linked to 
its mother's one X n through the relation 

X 2n = a' X n +P' + e' 2n , (1.3) 

where a' £ ( — 1,1), f3' Q £ K and {s' 2n ,n G T) is a sequence of independent centered 
Gaussian random variables with variance <Tq > 0. 

• With probability pi, only the old pole 2n + 1 divides. Its growth rate X 2n +\ is linked 
to its mother's one X n through the relation 

X 2n+l = a[X n + (3[+ e' 2n+1 , (1.4) 

where £ (— 1, 1), f3[ G R and (£ 2n+ i,n £ T) is a sequence of independent centered 
Gaussian random variables with variance a\ > 0. 
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FIGURE 1 . The binary tree T 



• With probability 1 — pi ; o — p\ — Po, which is non-negative, n gives birth to two cells 
which do not divide. 

• The sequences ((s2n, £2n+i), n G T), (s' 2n ,n £ T) and (£2n+i' n ^ ^) are independent. 

The process (X n ) described above is a typical example of BMC on GW tree. In this 
process is called bifurcating autoregressive process (BAR) with missing data. It is an extension 
of bifurcating autoregressive process when the data have a binary GW tree structure, see figure 
2 for example of binary GW tree. Indeed, one can assume that the cells which do not divide 
and those which do not exist are missing or dead. 

In |12| . Delmas and Marsalle using their results for BMC on GW tree, gave laws of large 
numbers and central limit theorem for the maximum likelihood estimator of the parameter 

9 = (ao,/3o,ai,/3i,ao,/?o,a'i,/30- ( L5 ) 

In this paper, we will give deviation inequalities for the least squares estimator of the pa- 
rameter 6, in case the noise sequence and the initial state X\ take their values in a compact 
set. Note that this implies that the BAR process with missing data describes above also 
take their values in compact set. These deviation inequalities are important for a rigorous 
non asymptotic statistical study. Indeed, when the sample size is insufficient to apply limit 
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theorems, they allow for example to estimate the errors in the estimation of unknown param- 
eters. Furthermore, these inequalities allow to get a rate of convergence in the laws of large 
numbers, and this permit, for example, to build non-asymptotic confidence intervals. 
We are now going to give a rigorous definition of BMC on GW tree. We refer to |12| for more 
details. 

1.2. Definitions. For an individual n G T, we are interested in the quantity X n (it may be 
the weight, the growth rate,- • • ) with values in the metric space S endowed with its Borcl 
cr-field S. 

Definition 1.1 (T-transition probability, see (|14j)). We call T '-transition probability any 
mappings P : S x 5 2 — >■ [0, 1] such that 

• P(.,A) is measurable for all A £ S 2 , 

• P(x, .) is a probability measure on (S 2 ,S 2 ) for all x 6 S. 

For p > 1, we denote by i3(5 p )(resp. Bb(S p ), C(S P ), Cb(S p )) the set of all 5 p -measurable (resp. 
5 p -measurable and bounded, continuous, continuous and bounded) mapping / : S p — > R. For 
/ 6 £>(S* 3 ), when it is defined, we denote by Pf G B(S) the function 



x\->Pf(x)= f(x,y,z)P(x,dy,dz). 
Js 2 



Definition 1.2 (Bifurcating Markov Chains, see (|14j)). Let (X n ,n G T) be a family of S- 
valued random variables defined on a filtered probability space (Q, J 7 , (J>, r G N),P). Let v 
be a probability on (S,S) and P be a T-transition probability. We say that (X n ,n G T) is a 
(J>) -bifurcating Markov chain with initial distribution v and T-transition probability P if 

• X n is T Tn -measurable for all n G T, 

• C{X X ) = v, 

• for all r G N and for all family (f n ,n G G r ) C Bb(S 3 ) 



E 



W fn(X n , X2n, X2n+1, 
,nGG r 



ngG r 



Now, we add a cemetery point to S, d. Let S = S U {d}, and S be the a— field generated 
by S and {d}. In the previous biological framework, S corresponds to the state space of 
the quantities related to living cells, and d is the default value for dead cells. Let P* be a 
T-transition probability defined on S x S such that 

P*(d,{(d,d)}) = l. (1.6) 

In the previous biological framework, (|1.6p means that no dead cell can give birth to a living 
cell. We denote by Pq and Pj* the restriction of the first and the second marginal of P* to S, 
that is: 

P£ = P*(;(-(~)s)xS) and p; = P* (•, S X (• f| S 1 )) . 

Definition 1.3 (BMC on GW tree, see [12]). Let X = (X n , n G T) be a P*-BMC on (S,S), 
with P* satisfying < TO)) . We call (X n , n G T*), with T* = {n G T : X n ^ d}, a BMC on GW 
tree. The P* -BMC is said spatially homogeneous if p\ t o = P*(x, S x S), po = P*(x, S x {d}), 
and pi = P*(x,{d} x S) do not depend on x G S. A spatially homogeneous P* -BMC is said 
super-critical if m > 1, where m = 2p\ q +pi +Po- 
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We denote by (Y n ,n E N) the Markov chain on S with Yq = X\ and transition probability 

Remark 1.4. • The name BMC on GW tree comes from the fact that condition (|1.6j) 

and spatial homogeneity imply that T* is a GW tree. 
• All through this work, we shall assume that the P*-BMC is super-critical. 




Figure 2. A binary GW tree up to the 4 th generation. In this tree, in- 
dividual 1 gives birth to two individuals which both divide, this happen with 
probability p\$. Individual 2 gives birth to two individuals which only one (the 
new pole) divides, this happen which probability p$. Individual 12 gives birth 
to two individuals which do not divide, this happen with probability 1 — p\ o — 
Po -Vl- 

Now, for any subset J C T, let 

J* = JnT* = {j E J : Xj ^ d} 

be the subset of living cells among J, and |J| be the cardinal of J. The process (\GZ |, k E N) , 
is a GW process with the reproduction generating function 

lj}{z) = (1 - po - pi - Pl j0 ) + (PO + Pl)Z + PlflZ 2 , 

and the average number of daughters alive is m. It is known, see e.g [3j, that m _fc |G^| 
converges in probability to a non-negative random variable W. Moreover, ¥(W > 0) = 1 iff 
there is no extinction. We have for all r > 0, 

E[\G* r \]=m r and E [|T*[] = ^ E [\G* q \] = ^ - 1 := t r . (1.7) 

z — ' H m — 1 

q=0 
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It is known, see [12] . that 3t~ x | TT* | converges in probability to W as well. 

For i E T, set Aj = (Xi,X2i,X2i+i) the mother-daughters quantities of interest. For a finite 
subset J C T, we set 

'E/(*0 tor feB(S), 



MjU) 



E/(A,) for/G^(5 3 ), 



with the convention that a sum over an empty set is null. We also define the following two 
averages of / over J 

Mj(f) = ±-Mj{f) if|J|>0 and Mj(f) = ^l^Mj(f) ifE[|J|]>0. (1.9) 

Limit theorems for averages ([1.9p have been studied in |12j for J = G* and J = T*, as n 
goes to infinity. Under uniform geometric ergodicity assumption for Q, we will establish in 
this paper deviation inequalities for those averages. These deviation inequalities will allow 
to highlight three regimes for the speed of convergence of above averages, thus showing a 
competition between the ergodicity of the embedded Markov chain (Y n ,n G N) and the size 
of the binary Galton- Watson tree. This new phenomenon is not observed in the asymptotic 
study of Delmas and Marsalle [12J. Notice that deviation inequalities were already studied in 
the no death case [8], that is m = 2. We will follow essentially the same approach that the 
latter paper for the proofs of our results. However, we will introduce some modifications on 
those proofs in order to take into account the randomness of index set, and we will make use 
of the theory of large deviation for branching processes [2] . Let us also mention [7J , where the 
authors establish deviation inequalities for estimators of parameters of the p-order bifurcating 
autoregressive process. 

The rest of paper is organized as follows. In section [2j we states our main results, that is 
deviation inequalities for averages (|1 .9|) . for J = G* and J = T*. This will be done under 
uniform geometric ergodicity assumption for Q, and suitable assumptions on the binary GW 
tree. In section [3l we will focus in particular on the first order bifurcating autoregressive 
process with missing data described in section [L~T1 Section [4] is dedicated to the proofs of our 
results. 



2. Main results 



We consider the following hypothesis: 

(HI): There exists a probability measure /j, on (S,S) such that for all / G Bb(S) with 

/) = 0, there is c > such that for all k G N and for all x G 5, \Q k f(x)\ < ca k . 
(H2): m > y/2. 

(H3): pi t Q + po + pi = 1, where pi t o, po and p\ are defined in section [TTTl 

Remark 2.1. Hypothesis (HI) implies that the Markov chain Y is ergodic, that is for all 
/ G Cf)(S) and for all x G S, lim E^f/^Yfc)] = (//,/). Assuming hypothesis (H3) means that 

k— ¥00 

we work conditionally to the non-extinction. Note that this is consistent with the study of E. 
Coli. 

Hypothesis (H2) comes from our calculations, indeed, in order to get relevant inequalities, 
i.e. inequalities for which the upper bound goes to zero as the sample size increases, we have 
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to assume that m > y/2. However, our deviation inequalities also work for m < y/2, but they 
are not relevant for this case. To get relevant deviation inequalities for m < \[2 is still an 
open problem that we will pursue in an other work. 

In the sequel, HL- will denote one of the set G r or T r . We set h r = (m 2 /2) r if H r = G r and 
h r = (m 2 /2) r+l if H r = T r . We can now state our main results. Notice that any function / 
defined on S is extended to S by setting f(d) = 0. 

Theorem 2.2. Under hypothesis (HI) and (H2), let f G Bb(S) such that (fi, f) = 0. Then 
we have for all 5 > 0: 

• if ma < 1, then Vr G N, 

p(m h *(/) > 5) < exp (c"6) exp (-c'5 2 h r ) ; 

• if ma = 1, then for M r = G r and Vr G N, 

p(m h *(/) > 5) < exp (c"5) exp (-c'5 2 (m 2 /2) r ) ; 

• if ma = 1, then for H r = T r and Vr G N, 

P(M H *(/) > a) < exp (c"<5(r + 1)) exp {-c'5 2 (m 2 /2) r+1 ) ; 

• if 1 < ma < \/2, i/ien Vr € N such that r > r$, 

p(m h; (/) ><j) < exp ; 

• if ma = \[2, then Vr G N suc/i i/iai r > ro, 

P(M H; (/)X5) <ex P (-^); 

• if ma > \[2, then Vr € N* suc/i that r > ro, 

p(Mh*(/)><5) <exp ("^); 

where, 

• r := log (j^j /log(a) - fc , wi/i fc G {0, 1}, 

• Co, c' and c" are positive constants which depend on a, m, and c and may differ line 
by line. 

Theorem 2.3. Under hypothesis (H1)-(H3), we have for all f G Bb{S) such that (fi, f) / 
and for all 5 > 0: 

• if ma < 1, then Vr G N, 

p(m H; (/) - (/x, f)W >5)< exp (c"<5) exp (-c'5 2 h r ) + A r ; 

• if ma = 1, then for M r = G r and Vr G N, 

p(Mh*(/) - ( M , /)W > 5) < exp (c"<5) exp (-c'5 2 (m 2 /2f) + A r ; 
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if ma = 1, then for M r = T r and Vr G N, 

p(m h? (/) " (M, />W > S) < exp (c"<5(r + 1)) exp (-cV(m 2 /2) r+1 ) + A r 
if 1 < ma < v2, i/jen Vr E N suc/i that r > ro, 

p(m h *(/) - (M./>^ > 5 ) < ^p {-c'5 2 h r ) + A r ; 
if ma = y/2, then Vr G N suc/i i/iai r > tq, 

M H ,(/) - (Ai,/)^ > 5) < exp f-^L) +Ar ; 
if ma > V% then Vr G N* such that r > r$, 

'M m *(f)-(fi,f)W>6\ <exp 



c'<5 2 



a 



2r 



where, 



for all r 6 N, 



'c / exp(-c"<5 2 / 3 (m 1 /3)-) 
exp (c'o" 2 / 3 ) exp (-c"«5 2 / 3 (i r /(r + 1) 2 ) 1/3 



r o : = lo s ( Jr ) / lo g(«) - fc o, g {o, i}, 



+ A r ; 



if I 
i/I 



TV 



• Co, c' and c" are positive constants which depend on a, m, and c and may differ line 
by line. 

Remark 2.4. For (^u, /) = (Theorem I2.2() . there is no additional term in the deviation of 
average Mh*(/). While in Theorem 12.31 there is an additional term A r which appears. This 
term is a contribution of the binary Galton- Watson tree on the deviation of average Me* (/) 
with respect to (fj,,f)W. This explain why we need additional hypothesis (H3) in Theorem 
2.3| because we have to deal with the deviation inequalities for Galton- Watson processes. 



The next results can be seen as a consequence of the previous results. 

Theorem 2.5. We assume that hypothesis (H1)-(H3) are satisfied. Let f E Bb(S). For all 
5 > 0, for all a > and for all b > such that b < a/(S + 1), we have 

• if ma < 1, then Vr G N, 

p(Mh*(/) - (M, /) > S\\V > a) < exp (c"56) exp {-c'{5b) 2 h r ) + A; 

• if ma = 1, i/ien /or H r = G r and Vr € N, 

p(m h? (/) - (M, /) > «5 W > a) < exp (c"Sb) exp (-c'(56) 2 (m 2 /2) r ) + A r ; 

• if ma = 1, i/ien /or H r = T r and Vr G N, 
- (A*,/) > > a) < exp (c"56(r + 1)) exp (-c'(56) 2 (m 2 /2) r+1 ) + A r ; 

if 1 < ma < \/2; i/ien Vr G N such that r > ro, 

p(Mh;(/) - (M,/> > > a ) < exp (-cWM + A r ; 
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• if ma = y/2, then Vr G N such that r > vq, 
'M m *(f)-(HJ)>6 



, , c'(<«>) 2 /i r . 
W > a < exp i— i — - + A- 



if ma > \[2, then Vr G N* such that r > r$, 

p(Mh*(/) - (m,/> > <^ > a) < exp ("^^) + A- 5 



where, 

• for all r G 




c' exp (-c"(<56) 2 / 3 (m 1 /3)r) j/ e r = G r 

exp (c'(<56) 2 / 3 ) exp (-c"(«>) 2/3 (i r /(r + 1) 2 ) 1/3 ) ifU r = T r 



• r := log (^J /log(a) - k , with k G {0, 1}, 

• Co, d and c" are positive constants which depend on a, m, a, and c, and may differ 
line by line. 

We have the following extension of above theorems when / does not only depend on an 
individual Xj, but on the mother-daughters triangle Aj. 

Theorem 2.6. Let f G B b (S 3 ). If {fi,P*f) = 0, then, under hypothesis (HI) and (H2), 

we have deviation inequalities of Theorem \2. 2\ for Mh*(/). If (/j,,P*f) ^ 0, under additional 
hypothesis (H3), we have deviation inequalities of Theorem \2.3\ for Me*(/) — (/i, P* f)W and 
of Theorem\M for M M *(f P*f). 

Remark 2.7. Let us stress that by tedious, but straightforward calculations, the constants 
which appear in the previous inequalities can be made explicit. 

Let us recall the following definition. 

Definition 2.8. Let (E,d) be a metric space. Let (Z n ) be a sequence of random variables 
valued in E, Z be a random variable valued in E and (v n ) be a rate. We say that Z n converges 
v n -superexponentially fast in probability to Z if for all 5 > 0, 

limsup — log¥(d(Z n ,Z) > 6) = -oo. 

This "exponential convergence" with speed v n will be shortened as 

p. su percx p ^ 
n v„ 

Remark 2.9. Let (b n ) be a sequence of increasing positive real numbers such that 

b n ->■ +oo 

and 

• if ma < \/2, the sequence (b n ) is such that b n j\fn — > 0, 

• if ma = \/2, the sequence (b n ) is such that (b n \/ log n) / y/n — > 0, 

• if ma > y/2, the sequence (6 n ) is such that b n a logn/l ° s{m2/2) — >■ 0. 



10 



S.V. BITSEKI PENDA 



Prom the previous deviations inequalities, we can deduce easily that 

M H; (/) SU ^ XP if(A*,/) = 0, 
for. , 

lh r ] 

and if (//, /) 7^ 0, we have for m < 2 3 / 5 

LhrJ 

and Va > 0, 

limsup^-logP(|MH»(/)-(M,/>| >*|W>a) =-00. 

So, for the exponential convergence of averages Mh*(/) and Mh*(/), there are three regimes 
according to the value of ma compared to y2. This phenomenon is not observed in the limit 
theorems of Delmas and Marsalle [12J. However, a similar phenomenon was observed recently 
by Adamczak and Milos for the central limit theorem of branching particle system [lj. 

So, our deviations inequalities highlight a competition between the ergodicity of the embedded 
Markov chain with transition probability Q and the Galton- Watson binary tree. 

3. Application:First order bifurcating autoregressive processes with missing 

DATA 

We consider the asymmetric auto-regressive processes given in section 11.11 Notice that the 
process {X{ ,i £ T) defined in section 11.11 with the convention that X% = 8 if the cell i is 
missing, is a spatially homogeneous BMC on a GW tree. We will assume that 2p\ q +p\ +po > 
\/2. This implies in particular that the BMC on GW is super-critical. We will also assume 
that the noise sequences ((£2nj £ 2n+l)> n G T), (s'2 n ,n G T) and (£2n+D n ^ ana ^ the initial 
state X\ take their values in a compact set. The latter implies that the process [X%,i G T) is 
bounded. We denote by S the state space of (Xi,i G T). We assume without loss of generality 
that S is a compact subset of R. 

Let T n ' 1 be the subset of cells in T* with two living daughters, (resp.T* ) be the set of cells 
of T* with only the new (resp. old) pole daughter alive: 

= {i G T* : Ai G S 3 } , T° n = {i G T* : A, G S 2 x {8}} and 
= {t G T* : A, G S x {8} x S}. 
We compute the least-squares estimator (LSE) 

7> C^ n /9 n ■^n "an Cr'n o'n ->'n a'n\ 

of 9 given by (|1.5p . based on the observation of a sub-tree T* +1 . Consequently, we obviously 
have for 7/ G {0, 1}, 

l^n' !" 1 E X i X 2i+V~ I l^n !" 1 E X{ I I l T n'°l~ 1 E X2i +V 
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Prj — 1 E ^2i+r? - S™|T^'°| 1 E X, 




mr 1 e XiX 2i+ri - [ \ti\- 1 I ra _1 E 

a r? = : 

ra-'E^- (mr'E^ 
ft? = rer 1 E - s^i^r 1 e 

Notice that those LSE are based on polynomial functions of the observations. So, since the 
latter are bounded, we are in the functional setting of the results of section [2j Recalling 
the Markov chain (Y n ,n G N), notice that Y n is distributed as Z n = aia 2 • • • a n _ia n Io + 
Ysk=i a i°2 ' ' " Ofc-i&fc, where b n = b' n + s n e n , ((a n , b' n , s n ),n > 1) is a sequence of independent 
identically distributed random variables, whose common distribution is given by, for r\ G 
{0,1}, 

P(oi = a v , b\ = f3„, si = a) = and P(ai = a' b\ = (3' s\ = an) = — , 

m 1 1 m 

(e n ,n > 1) is a sequence of independent A/"(0, 1) random variables, and is independent of 

((a n ,b' n , s n ),n > 1), and both sequences are independent of Yq. Moreover, it is easy to 

check that the sequence (Z n ,n G N) converge a.s. to a limit Z, which implies that the 

Markov chain (Y n ,n G N) converge in distribution to Z. We refer to |12j . section 6, for more 

details. Following the proof of Proposition 28, step 1 in [14| . we check hypothesis (HI) with 

a = max(|ao|, l a i|, l^ol' < 1 an( ^ with M the distribution of Z. Let fx\ = E \Z\ and 

H2 = E [Z 2 ] . We have (see [12]) 



/3 , 2aPP/(l -a) + (3 2 + a 2 

Ml = l = and u 2 = = , 

I- a I- a 2 

where a = E [oi], a 1 = E [a\] , /? = E [&i], /J 2 = E [6 2 ] , a/3 = E [ai&i] and ^ = E [s 2 ] . 
We then have the following deviation inequality for 9 n — 9. 

Proposition 3.1. For all 5 > 0, for all a > 0, for all b > and for all 7 > such that 
b < a/ {5 + 1) and 7 < min |ci/(l + $),c\/ ^1 + |, where c\ is a positive constant which 
depends onp\$, po, p\, \i\ and \i2, and for hq := (log (^ q 5 p b/co) /log a) — 1, we have 

• if ma < 1, then Vn G N, 

P (||0„ - 0|| > 5|PF > a) < c 2 exp (c"j q 6 p b) exp f-c' (7W) 2 (m 2 /2) n+1 ) + A n ; 

• if ma = 1, i/ien Vn G N, 
n - 0|| > <y|W > a) < c 2 exp (c"7^ p 6(n + 1)) exp (-c 1 (7W) 2 (m 2 /2) n+1 ) + A n ; 

if 1 < ma < v2j f/ien Vn G N suc/i f/iaf n > no, 

P (||0 n - 6\\ > S\W > a) < c 2 exp (-c' ( 7 <W) 2 (m 2 /2) n+1 ) + A„; 
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• if ma = y/2, then Vn G N such that n > no, 

¥ (\\6 n - 9\\ > S\W > a) < c 2 exp f-c' (7W) 2 (1/n) (m 2 /2)™ +1 ) + A n ; 

• if ma > v2, i/ien Vn G N* suc/i £/ia£ n > no, 
r n - 0|| > <f|W > a) < c 2 exp (-c' (7 9 5 p 6) 2 a" 2n ) + A„; 

w/iere A n = c 3 exp (V ( 7 W) 2/3 ) exp (-c" ( 7 W) 2/3 (t n /(n + l) 2 ) 1/3 ) 7 p G {1/2, 1}, g G 

{0, 1/2, 1} ; C2, C3 ; C4, c' and c" are positive constants which depend on c, m, a pio, po, Pi, 
fii and \x 2 . 

Remark 3.2. Note that the constants C2, C3, C4, d and c" which appear in Proposition 13.11 
may differ term by term. The values of p and q depend on the magnitude of 5 and 7. For 
example, for 5 and 7 small enough, we have p = 1 and q = 1. We also stress that all these 
constants can be made explicit by tedious calculations. 

4. Proofs of the main results 



4.1. Proof of Theorem 12. 21 Let / G Bb(S) such that (n, /) = 0. We are going to study 
successively Mh*(/) for M r = G r and EL- = T r . 

Step 1. Let us first deal with Mg*(/). By Chernoff inequality, we have for all 5 > and for 
all A > 



(/)><$)< exp(-AW)E 



exp A J] fiX,) 



i€G* 



(4.1) 



Recall that for all i G 

E [f(X 2i )l {2ieT * } + f(X 2i+1 )l {2i+le ^ } \T r -i] = mQf(Xi). 

By subtracting and adding terms in expectation of the right hand of (|4.ip . and conditioning 
with respect to J>_i, we get 



E 



x E 



exp [ A £ fiX,) 



E 



exp j A ^ niQf{Xi 



(4.2) 



exp ^2 A (/(^2i)l{2 ie T*} + /(^2i+i)l{2i+ieT*} - mQfiXi)) 



r-1 



Observing that ( G*_ 1 is T r -\ measurable, and using the fact that conditionally to J>_i, the 
triplets {(Aj),? G G r _i} are independent (this is due to the Markov property), we have 



E 



exp ^2 A (/(^2i) 1 {2ieT*} + /(^2i+i)l{2i+ieT*}-™<3/(^)) 



r-1 



(4.3) 



= n ■ 
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Using Azuma-Bennet-Hoeffding inequality [3], [5], |16j . we get according to (HI), for all 

i £ G*_ v 



E 



exp (A (f(X 2 i)l { 2ieT*} + /(^2i+i)l{2i+ieT*} - rnQf(Xi))) 



< exp 



c 2 X 2 (2 + maY 



From ()4,3p . this implies that 



E 



exp ( ^2 \(f(X 2 i)l { 2ieT*} + f(X 2i+ i)l{2i+ieT*} ~ m Qf( x i)) 

c 2 X 2 {2 + ma) 2 \G*_ 1 



< exp 



< exp 



c 2 \ 2 (2 + ma) 2 \G r - 1 \ 



where we have used the fact that |G*_]J < |G r _i| in the last inequality. Recalling 
are led to 



21), we 



E 



exp [ A £ f{X t ) 

ieGi 



< exp 



c 2 \ 2 (2 + may\G T -i\ 



xE 



exp 



Reproducing the same reasoning with Qf and G*_ x instead of / and G*, we get 

c 2 \ 2 m 2 (2a + ma 2 ) 2 |G r _ 2 1 



E 



exp { Am ^ Q/(AQ 



< exp 



xE 



exp Am 2 ^ Q 2 /(^ 



Iterating this procedure, we get 



E 



exp A £ f(Xi 



c 2 X 



< e xp — - (2a 9 + ma ?+1 ) m 2 '?-'-' 



E[exp(Am r Q r /(Xi)) 



< exp 



c 2 X 2 (2 + ma) 2 2 r - 1 ^ (a 2 m 2 ^ q 



E 

q=0 



x exp (Ac(am) r ) , 
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where the last inequality was obtained from (HI). From the foregoing and from (|4.ip . we 
deduce that 



(m g; (/)>*) < < 



/ . . _ c 2 X 2 (2+ma) 2 (2 r -( a 2 m 2 ) r ) 

exp[-A<5m + 2 (2-a*m*) 



x exp (Ac(am) r ) if a m 2 / 2, 
exp (-A<5m r + c 2 A 2 (2 + V2) 2 r2 r ~ 2 ) exp (Ac(\/2) r ) 

if a 2 m 2 = 2. 



Now, the rest divides into four cases. In the sequel c\ and c 2 will denote positive constants 
which depend on c, m, and a. 

• If ma < 1, then, for all r G N, (ma) r < 1 and 2 r — (a 2 m 2 ) r < 2 r . We then have 



P (M G * (/) > < exp (cA) exp (-A<5m r + A 2 ci2 r ) . 
Taking A = (5m r )/(2 r+1 ci) 1 we are led to 

P (m g *(/) > 5) < exp ( C1 S) exp (-5 2 Cl ) . 

• If 1 < ma < v2, then, since 2 r — (a 2 m 2 ) r < 2 r , we have 

P (Mg*(/) > *) < exp (-A<W + A 2 ci2 r ) exp (Ac(ma) r ) . 
Taking A = (<5m r )/(2 r+1 ci), we are led to 

Mg*(/) > 5) < exp (-c 2 ,5(m 2 /2) r (5 - 2ca r )) • 



For all r £ N such that r > log(5/4c)/ log (a), we have 5 — 2ca r > 5/2 and it then follows 
that 

f M G *(f) >s)< exp (-c 2 5 2 (m 2 /2) r ) . 



• If ma = \/2, then we have 

P (M G *(f) >5^j < exp (-X6m r + A 2 cir2 r " 2 )) exp (\c (V2 
Taking A = (5m r ) / '{c\r2 r ^ 1 ) , we have for all r > log(5/4c)/log(v / 2/''7i)j 

P (m g; (/) > 5) < exp (-c 2 <5 2 (l/r)(m 2 /2)'-) . 

• If ma > \/2) then we have 

P (M G *(f) >5^j < exp (-X5m r + A 2 ci(m 2 a 2 ) r ) exp (Ac(ma) r ) . 
Taking A = 5/(2ci(ma 2 ) r ), we have for all r > log(5/4e)/log a, 

P (Mg;(/) >S)< exp (-c 3 5 2 a" 2r ) . 
This ends the proof for M r = G r . 



DEVIATION INEQUALITIES FOR BIFURCATING MARKOV CHAINS ON GALTON- WATSON TREE 15 

Step 2. Let us look at Mj*(f). By Chernoff inequality, we have for all 5 > and for all 
A > 



(m** (/)>$) <exp(-X5t r )E 



exp A f( X i 



(4.4) 



Expectation which appears in the right hand of (|4.4p can be written as 



E 



exp [ A f( X i 

igT* 



E 



exp A f( X * 



(4.5) 



x exp j A (f + ™Qf)(Xi 



x E 



exp j A ^ {f(X 2 i)l{2ieT*} + /(^2i+i)l{2i+ieT*} - rnQf(Xi)) 



Observing that is -7>_i measurable, and using the fact that conditionally to Jy_i, the 

triplets {(Aj),i G G r _i} are independent and Azuma-Bennet-Hoeffding inequality, we obtain 



E 



exp j A ^ (/( x 2i) 1 {2ieT*} + /(^2i+i)l{2i+ieT*} - mQf(Xi)) 



J>_i 



exp (A (/(^2i) 1 {2iex*} + f(X 2 i + i)l{2 i+ i € T*} - mQf{Xi))) 



J>_i 



< exp 



< exp 



c 2 X 2 (2 + ma) 2lrr * 



r-l! 



c 2 (2 + ma) 



"r-l\ 



where the last inequality was obtained using the fact that < |G r _i|. From the foregoing 

and from (14.51). we deduce that 



E 



exp ( A f(*i 



x E 



< exp 



c 2 (2 + ma) 2 \G r -i\ 



<>x- P |A Y /(^))exp[A (f + rnQfXXi) 



Doing the same thing with (/ + mQf) and G*._ 1 instead of / and G*, we get 
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E 



exp[A /TO J exp [ A ^ (/ + mQ/)TO 



E 



exp A ^ /TO xexp A £ (/ + mO/ + m 2 Q 2 /)W 



iGT* 



x E 



exp I A ^ f (/ + mQ/)TO)l{2i G T*} + (/ + ^Q/)TO+i)l{ 2 i+ieT*j 



(mQf + m 2 Q 2 f)(X, 



< 



exp 



x E 



c 2 A 2 (2 + 3ma + m 2 a 2 ) 2 |G r _ 2 



exp A /TO xexp A £ (/ + mQ/ + m 2 Q 2 /)(A ; ) 



Iterating this procedure, we are led to 



E 



exp [ A f(Xi 



< exp 



q=l \k=0 / 



x E 



exp(A^mWTO) 

q=0 



< 



exp 



c 2 (2 + m«) 2 A 2 r f q ~' 



g=l \fc=0 



exp 



9=0 



ma) 



where the last inequality was obtained using hypothesis (HI). In the sequel, Co, c\ and c 2 
will denote some positive constants which depend on a, m, and c. They may differ from one 
line to another. For ma ^ 1 and ma 7^ \/2, we deduce from the foregoing and from 
that 



M r *(f) >S) <exp(-A<5t r .)exp 



x exp 



Ac(l - (ma) r+1 ) 
1 — ma 

c 2 (2 + ma) 2 A 2 / _ 2ma(2 r - (ma) r 



2(1 - ma) 



2 — ma 



+ 



(ma) 2 (2 r - (m 2 a 2 ) r ) 



< exp ( — X5t r + 



c 2 (2 + ma) 2 \ 
2(ma 



2 — (ma) 2 

0^ / (ma) 2 (r-(m 2 a 2 Y) 
l) 2 V 2 -(ma) 2 

Ac(l - (ma) r+1 ) 



x exp 



1 — ma 
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_, . , 5t r (ma — l) 2 

Taking A = — — - , we are led to 

C 2(2 + maf (> - 1) + (ma) 2 ( ! ( ^ Q } } ) 

P M T? / > 5 < exp ^ " ; 2 2 , r ,, 

\ 2cH2 + rna)^ (y - 1 + (ma) ^2 } ° ) 

(5(1 - ma) 2 t r 1 - (ma) r+1 

X eX P I -7 ^„^o, ,„2-2^N x 



— ma 



Now, the rest of the proof divides into five cases. 

• If ma < 1, then, for all r £ N, (raa) r+1 — 1 < ma — 1 and 2 r — (ma) 2r < 2 r . We then 
deduce that 

P (m t? (/) ><5) < exp (ca*) exp (-c 2 5 2 (m 2 /2) r+1 ) . 

• If 1 < ma < y/2, then we have 

P (Mt ; (/) ><5) < exp (- Cl <5 2 (m 2 /2) r+1 ) exp (c 2 6 ~ 1 

< exp (-5c 2 (m 2 /2) r+1 (5 - c a r+1 )). 

Now, for all r G N such that r + 1 > log(<5/2co)/log(a), we have 8 — coa r+1 > 5/2, in such a 
way that 

P (Mt*(/) > < exp (<5 2 c 2 (m 2 /2) r+1 ) . 

• If ma > a/2, then for all r G N, (m 2 a 2 ) r > 2 r . We then have 

P (m t .(/) ><j) < exp (-c 2 ,5a- 2r (5 - c a r+1 )) . 
Now for all r G N such that r + 1 > log(<5/co)/ log(a), we have 

p(m t .(/) >5) <exp' 



a 2r 



• If ma = 1, then 

P (m T; (/) > < exp (-Atftr + ci2 r A 2 ) exp (Ac(r + 1)) 
Taking A = 5t r / 'c\2 r+l , we are led to 

P (Mt ? (/) ><5) < exp (ci^^J^n^ exp (_ C2< 5 2 (m 2 /2)^ 1 ) . 

• If ma = y/2, then 

P (M T *(f) >5^j < exp (-XSt r + A 2 ci(r + l)2 r ) exp (Aci(V2) r+1 ) . 
Taking A = 8t r /(2ci(r + l)2 r ), we are led to 
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M T *(.f) >5) <exp 



r + 1 I 2 J I \ m 



Now, for all r 6 N such that r + 1 > log(<5/co)/log(v2/m), we get 

s:2 / 2 \ 

c 2 I m z \ \ 



M T? (/) ><5 <exp 



r + 1 



This ends the proof for 



TV. 



4.2. Proof of Theorem 12.31 Let / G Bb{S) such that (/i,/) 7^ 0. Once again, we are going 
to study successively M G *(/) and Mj*{f). 

Step 1. Let us first deal with M G *(/). Set g = f — (/i,/)- Then, = and 

M G *(/) = M G .(g) + (|G;|/m r )(^,/). 

We have 



;(/)-(/<.. mrxi) <p(m g? ( 5 )>V2 



(4.6) 



+P 



tf 



> 



2|(M,/>I 



As (fi,g) = 0, the previous computations (proof of Theorem 12. 2p give us some bound for 
the first term of right hand of (|4.6p . similar to those obtain in Theorem 12.21 Now, under 
hypothesis (H3), we deduce, from [2] Theorem 5, that 



\g; 



w 



m' 



> 



< c 2 exp ( -c 3 (5 2//3 m r//3 ) , 



2|(M,/)|. 

and this ends the proof of Theorem 12.31 when M r = G r . 

Step 2. Let us look at M T *(/). For / € B b {S), set g = f - (fJ,,f). Then, (fj,,g) = and 

M T *(f) = M T ,(g) + (\T;\/t r )(f,,f). 

We have 

'Mr t (f)-(^f)W>5) <F[M r *(g)>6/2) (4.7) 



+1 



|T*| 



> 



2|(m,/)i; • 



Since {/J>,g) = 0, the first term of the right hand of (|4.7p can be bounded as in the previous 
computations (proof of Theorem 12. 2p . Under additional hypothesis (H3), we have, from |2j 
Theorem 5, 



it: 



w 



> 



2| <)",/> I 



< 



P - 



(7=0 



W 



> 



9=0 



m 9 



IF 



2(r + l)|( M ,/)| 

St r 

> 



2{r + l)\(fi,f)\mi 
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= 2 exp (-^(j-J" 

q=0 \ 

< c 2 exp -c 3 5 2/3 



t 



(r + 1 



(r + \)m q 

1/3^ 

ol M l+o(l)), 



and this ends the proof of Theorem 12.31 when 



TV. 



4.3. Proof of Theorem 12.51 Let / G Bb(S). Without loss of generality, we assume that 
(/i, /) = 0. Otherwise, we take / — (fj,, /). For all 5 > 0, for all a > and for all b > such 
that b < a/ (5 + 1), we have 



(M H ,(/) ><5|W > a) = P ( M H *(/) > 5, ^ > 6|iy > 



+ P(M H; (/)> ,5,1^ <(,!»-> a 



w>^) p r H;(/)>i ' 



> > a 



+ P(M H? (/)><5,^<6,VF>a 



<p a P (M H *(/) ><5&) + p a P 
< Pa ¥ (M w (f)>5b) + Pa F 



W 



w 



> W - b, W > a 

> 5b 



where p a = P (W > a)~ . Now, the first term of the last inequality can be bounded as in 
Theorem 12. 2| and the second term is bounded as in the step 1 and and step 2 of the proof 
of Theorem 12.31 This ends the proof. 

4.4. Proof of Theorem \ZM Let / G B b (S 3 ). 

Step 1. Let us first deal with M<g*(/). Assume that (fj,,P*f) = 0. By Chernoff inequality, 
we have for all 5 > and for all A > 0, 



'(Mo* (/)>*) <exp(-AW)E 



exp A ^ /(A*) 



Conditioning by J>, and using, conditional independence of triplets {Ai,i G G r } with respect 
to J>, Azuma-Bennet-Hoeffding inequality and (H2), we obtain 



E 



exp A ^ /(A, 



E 







| E 


exp | 
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E 



exp ( A ^ P*f(X t ) J H E [exp (A(/(A<) - P*f(X i ))) 



i€G* 



<exp (2A 2 ||/|| ooCl m r )E 



exp A P*f{*i 



We control the last expectation as in the Step 1 of the proof of Theorem 12.21 apply to P*f. 
Next, we get the result discussing as in the proof of Theorem 12.21 
If (fi, P* f) ^ 0, we set g = f — (//, P* /). Then, we have 



.(f)-<jM,P m f)W>8) <P (M G »(g) >5/2 



(4. 



+P 



W 



m' 



>6/2\(»,P*f}\). 



The first term of the right hand of (|4.8p can be bounded as previously since (/i, P*g) = 0. 
The second term can be bounded as in Step 1 of the proof of Theorem 12.21 This ends the 
proof for M G; (/). 

Step 2. Let us now treat Mj*(f). First, we assume that (fj,,P*f) = 0. For all 5 > 0, we have 

Mxf. (/) > s) < F I 1 - P*f(X l )) > 6/2) + p(m t .(P*/) > 5/2 

By chernoff inequality, we have for all A > 0, 

p(i^(/(A,)-P*/(^))>V2J <ex; 



xE 



exp A^(/(A,)-P7(Xi)) 



ieTi 



Conditioning successively with respect to {J-q)o< q < r , using conditional independence of triplets 
{Aj, i £ G q } with respect to T q and applying successively Azuma-Bennet-Hoeffding inequality 
and the fact that |G*| < \G q \ for all q G {0, • • • , r}, we get 



E 



exp A£(/(Ai)-P7M) 



E 



E 



exp (A ^ (/(Ai) - P*/(Xi)) ) E 



exp A J3(/(Ai)-P*/TO) 



exp (A ^ (/(AiJ-P*/^))] n E [ ex P( A (/(Ai)-P7(^))) 
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< exp (2A Z 



|G r |)E 



exp A (/( A ^ " P *f( X i)) 



< exp (2A S 



|T r |). 



Next, optimizing on A, we obtain 



T r ^} f{Ai) " P * f(Xl)) > V2 ) " exp (~ Cl62 



for some positive constant c\. The term P yMj*{P* f) > S/2J can be bounded as in the 

proof of Theorem I2.2[ and this ends the proof when (fi,P*f) = 0. On the other hand, if 
{fj,,P*f) ^ 0, we have 

IT* 



M T *(f)-(n,P*f)W = M r *(g) + 



W\(vL,P*f). 



We then proceed as for (|4.8|) . and this ends the proof for Mj*(f). 

Step 3. Eventually, we bound P (~M m *(f) > 5 - (fi,P*f) > S) , using Step 1 and Step 2, 
as in the proof of Theorem 12.51 



4.5. Proof of Proposition [3TTT We are going to treat Sq — ao- Deviation inequalities for 
2™ — ai, {3™ — f3 v , a™ — a', — /3' v , r] G {0, 1}, can be treated in the same way. Recalling 
that the state space of the process (Xi,i £ T*), denoted by S, is assumed to be a compact 
subset of PL 

Let gi, g 2 , hi and h 2 the functions defined on S* 3 respectively by 

gi(x, y,z) = (xy - x(a x + /3 )) l s s(x,y, z), 

g 2 {x,y,z) = (y - a x - /3 )) l S 3(x,y,z), 

hi(x,y,z) = xl s s(x,y,z), 

h 2 (x,y,z) = x 2 l s3 (x,y,z). 

It is easy to see that P*g\(x) = 0, P*g 2 (x) = 0, P*h\{x) = piflX and P*h 2 (x) = piflX 2 where 
P* denote the transition kernel associated to the BAR(l) process with missing data. With 
these notations, we can rewrite 2q — ao as 



IT*!- 1 !^' ! 



a - a 



B, 



p>„ 



where B r , 



I tip* I — 1 1 ^1,0 1 



inr 1 E 

iGT* 



mr 1 x: ma* 
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Recalling (|l,8p . we then have for all 5 > and a > 

^I-^IIMt,^)! ^ 5 



\afi-ao\ >5\W>a) < 



\Bn\ 



> 2 



W> a 



+1 



|5 n | 2 

-l| T l,0i 



For the first term of the right hand of (|4.9p . since |T*| |T n ' | < 1, we have for all 7 > 



in- u nM T .( gi )i ^ 6 

\B„\ 2 



W > a] < F(\B n \ < j\W > a) 



+F(\M T * n ( gi )\> S -l 



W> a) . 



Notice that 



IT 



1,0 1 



Ti 



■Pi,o + 



IT 



1.0 1 



T* 



-Mr* (h 2 - pi,oA*2) 



(M T *(/ii -pi,oMi)) - 2pi,oMiM T *(/ti -pi.oMi) 



and 



{|5 n | < 7} C {|B n - (Pi, M2-Pi,oMi)l > bi,oM2-Pi, Mil -7}- 

2|p? M2-Pi,oM?l 



We then have for all < 7 < 



\Bn\ 2 



2 + 5 
W > a I 



<P(|B n | <7|W>a)+pf|M T .(si)| > ^ 



TV > a 



7<5 



e>7 



< P ( |B n - (pi >0 n 2 -pi >0 ri)\ > y W > a) + P MMt* (5i)| > -77 
<PMM T *( 5 i)| > y|^>a) +p(|M T ,(/ l2 -p 1 ,o/i 2 )| > y 



TV > a 



+ P(JM t ,(/h -pi l0 /ii)| > 
+ P(|M T .(/ii -pi,oMi)l > 



2^2 

7 S 



16pi,o/ii 
From 1131 . Section 5, we have 



U r > aj +P 
VF > a 



IT 



1,0 1 



T* 



Pi,o 



W>a 
£7 



> 



'1,0/^2 



7/ > a 



IT 



1,0 1 



T* 



Pifl 



> 



5-f 



'1,0^2 



VF > a 



T 



! ,T*I 



i=l 



> 



5-f 



'1,0M2 



VF > a 
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where {Tj)- >1 is a sequence of i.i.d. Bernoulli random variables such that 



Pl,0 



>(T, = 1) 



1 



•m = o). 



Lj — X, — _L U V _Lj 

To majorize the right hand side of (|4.10p . we use exactly the same ideas that for the proof of 
Theorem 12.51 and Step 2 of the proof of Theorem 12.21 



For the second term of the right hand of (I4.9P , we have 

Wf*{92)\ 



|A/ T .(fr 1 )||M T «(g 1 )| > 6 
\B„\ 2 



f \M ri {g 2 )\ VS 
\ \B n \ 2 



W > aj <1 
W > a | + 



\Bn\ 



> 



4pi ,oMi 



W > a 



|M T »(/ii -pi,oMi)l > 



W> a 



Now, the first and the second term of the right hand of the last inequality can be treated as 
the first term of the right hand of (|4.9p . 

Finally, to get the result, just apply Theorem 12.61 to functions gi, g 2 , hi and h 2 . 
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